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Abstract We examine the convexity and tractability of the two-sided linear chance con¬ 
straint model under Gaussian uncertainty. We show that these constraints can be applied 
directly to model a larger class of nonlinear chance constraints as well as provide a reason¬ 
able approximation for a challenging class of quadratic chance constraints of direct interest 
for applications in power systems. With a view towards practical computations, we develop 
a second-order cone outer approximation of the two-sided chance constraint with provably 
small approximation error. 
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1 Introduction 

Chance constraints (or probabilistic constraints) were among the first extensions proposed 
to linear programming as a natural formulation for treating constraints where some of the 
coefficients are uncertain at the time of optimization [S]. In the chance constraint model, we 
suppose that the uncertain values follow a known distribution and enforce that the constraint 
holds with high probability as a function of the decision variables. 

Nemirovski and Shapiro m observe that, in general, convexity and tractability results in 
chance constraints are a rare combination. When the corresponding deterministic constraint 
is convex, the chance constraint may be nonconvex. And even for those chance constraints 
which are in fact convex, the authors cite examples where such constraints remain 
computationally intractable because it is NP-Hard to test if the constraint is satisfied. For 
linear chance constraints of the form 

P(z T £ < b) > 1 - e, (1) 

where x £ R” and b £ R are decision variables, the constraint is known to be convex (that 
is, the set {{x,b) : P(x T £ < b) > 1 — e} is convex) and computationally tractable when £ 
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has an elliptical log-concave distribution nn. examples of which include the multivariate 
Gaussian distribution and few others. The computational challenges presented by chance 
constraints have motivated approximation schemes m and alternative formulations such 
as robust optimization [3]. 

Even more challenging than linear chance constraints, joint chance constraints require 
that a set of linear constraints hold jointly with high probability. Prekopa ITS] reviews many 
of the standard results. In particular, he proves convexity of the constraint P(cc > £) > 1 — e 
with respect to x £ R™ when £ follows a multivariate continuous log-concave distribution 
and of the constraint ¥(Tx > 0) > 1 — e when some elements of the matrix T are random 
with a joint Gaussian distribution and have a specialized covariance structure between the 
rows of T (further generalized by [TO]). Van Ackooij et al. [21; consider rectangular chance 
constraints of the form P(a <£<&)> 1 — e with respect to vectors a and b where £ follows a 
multivariate Gaussian distribution. Their model does not allow for products between random 
variables and decision variables. 

The basic model we consider in this work, which is a special case of a joint chance 
constraint, is the two-sided chance constraint 

P(a < £ T £ < b) > 1 — e, (2) 

where a £ R, b £ R, and x £ R" are decision variables, and £ is jointly Gaussian with 
known mean and covariance. In Section [3j we prove that this constraint is in fact convex in 
a, b and x given e < The proof, which we believe is the first, follows from a geometri¬ 
cal insight combined with standard tools for chance constraints such as log-concavity. The 
major methodological contributions of this work lie in the subsequent generalizations of the 
model and in our analysis of the computational tractability of the chance constraint. In 
Section [4] we show that a number of seemingly more complex and nonlinear constraints can 
be formulated by using the two-sided constraint ©■ In Section [5j we demonstrate compu¬ 
tational tractability of these constraints under a modern mathematical optimization lens. 
In addition to an exact derivative-based nonlinear formulation, we develop an approximate 
second-order cone (SOC) formulation for ([ 2 ]) with provable approximation quality. This SOC 
formulation permits one to incorporate such constraints into large-scale models solvable by 
state-of-the-art commercial and open-source software. 

Using © as a primitive, we develop an approximation for the more challenging chance 
constraint 

P((a T £ + b ) 2 + (c t £ + d) 2 < k) > 1 - e, (3) 

where a,c £ R ra , &, d, k £ R are decision variables, and £ is jointly Gaussian with known 
mean and covariance. This constraint is motivated by applications in power systems which 
we discuss in Section [2j In Section [bj we study the constraint © in detail and compare a 
number of approximation schemes, ultimately demonstrating that our approximation based 
on two-sided constraints is reasonable and of practical interest for its tractability. 


2 Motivation 

The basic question which motivates this work is the short-term planning problem, known as 
optimal power flow (OPF), which is solved as part of the real-time operation of the power 
grid to determine the minimum-cost production levels of controllable generators subject to 
reliably delivering electricity to customers across a large geographical area HUS]. Conceptu¬ 
ally, OPF is similar to a network flow problem with the additional complication that power 
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flows according to the nonlinear KirchhofT laws. On top of the nonlinear power flow laws, 
we aim to consider the uncertainty in production levels of renewable energy sources such as 
wind and solar photovoltaic. 

In its traditional, deterministic form, OPF seeks to minimize total production costs 


minimize 

P,e,f 


i&g 


(4) 


subject to the constraints 

£ fbn- ^2 f mb = ^ 2 Pi+Wb ~ db ’ € ( 5 ) 

n:{b,n}£jC m:{m,b}(zjC i€Gb 

P? in <Pi<P? ax , Vi eg, ( 6 ) 

fmn = fimn(Pm @ n)i V{?Ti,7r} G £, (7) 

~fmn < fmn < f™*, V{m, n} G C, (8) 

where B is the set of nodes (buses) in the grid, Q is the set of generators, G& is the set 
of generators located at node b , and £ is the set of edges (transmission lines). Decision 
variables pi denote the production levels of generator i, and the variables f mn denote the 
flow from node m to node n. The value db is the demand at each node (assumed to be 
known), and the value Wb is the forecast production level from renewable energy sources 
(again assumed to be known). Constraint ([5]) is the familiar flow balance constraint which 
balances supply with demand at each node. Constraints 0 and ^ enforce the capacities 
of the generators and transmission lines, respectively. The constraint 0 links the flows to 
the bus angles 6 and arises from the standard “DC” linearization of the nonlinear power 
flow laws; hence, this formulation is often called DCOPF. The formulation as stated above 
is efficiently solvable by linear programming on large-scale systems with tens of thousands 
of nodes within real-time operational constraints. 

Our motivation is to address two major deficiencies in the standard DCOPF model. The 
first major deficiency is the deterministic nature of the model. In particular, the amount of 
power generated by renewable energy sources such as wind is highly variable and must be 
accounted for in short-term planning. 

The line of work by [51H6] addresses this deficiency by introducing chance constraints. 
More specifically, Bienstock et al. [5] propose to model the deviations from the forecast wind 
production levels as zero-mean Gaussian random variables u>b, combined with a proportional 
response policy for the generators. Letting f2 be the total, real-time deviation from the fore¬ 
cast (a positive value if there is more renewable generation than expected), each generator 
has a proportional response coefficient a, and adjusts its real-time production to match 
Pi — aifi. If Yhi a i = 1) then this response policy guarantees balance of supply and demand, 
although it does not guarantee that output capacities or transmission capacities are always 
satisfied. Both pi and oti are decision varibles. Transmission capacities, in practice, are soft 
constraints, and hence [S] propose to enforce them as chance constraints 

ni/ m j<cr)>i-c (9) 

where f rnn is the random flow driven by the deviations u>b- Bienstock et al. [5] then approx¬ 
imate Q by splitting it into two constraints 

nf mn < fZT) > 1 - £ and P(f mn > ~f%?) > 1 ~ 


(10) 
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both of which can be expressed as simple linear Gaussian chance constraints ([T]). The 
assumption that deviations from the forecast follow a Gaussian distribution is made for 
tractability. This assumption can be further refined, with practical gain, without loss of 
tractability by introducing uncertainty sets on the parameters of the Gaussian distribu¬ 
tion jIB| . 

The second major deficiency in the standard DCOPF model is the crude approximation 
it provides of the true, nonlinear, nonconvex power flow laws. In particular, the linearized 
model assumes constant voltage and therefore neglects so-called reactive power flow, which 
is the imaginary component of complex-valued power flow. The real component is referred 
to as active power. Although we cannot directly treat the nonconvex case, we propose to 
consider more accurate linearizations which account for reactive power, such as those which 
arise from linearizing around a current operating solution [B|. When extending the model 
of [5] to account for reactive power, we obtain chance constraints of the form 

+ (. fZn tive ) 2 < (.cd 2 ) >i e, (ii) 


because transmission capacities are limited by the magnitude of the complex-valued power 
flow across a line. 

Our first attempt at studying the constraint © led us to study the simpler two-sided 
form ([2|. These results, in turn, provided us with a means to approximate (111, as we 
discuss in Section [6] The approximation we derive here has already yielded a practical 
implementation in the JuMPChance modeling package m which is being used to study the 
value of the model we propose in ongoing work m- 


3 Convexity of two-sided Gaussian linear chance constraints 

The main result in this section is the convexity of the two-sided chance constraint ([2]). 

Let ip(x) = b e the standard Gaussian density and <P(x) = f^^ipftjdt the 

Gaussian integral. 

Definition 1 Let £ ~ IV(0,1) be a standard Gaussian random variable. Let e £ (0,1). We 
define the set S e := {(x, y) £ R 2 : P(x < £ < y) > 1 — e}. 

Note that S e has two equivalent representations as {(x, y) : <p(t) dt > 1 — e} and {(x, y) : 

<P(y) — <P(x) > 1 — e}. 

We will proceed to prove that S f is convex, but first we define log-concavity and recall 
some basic properties. See Boyd [8] for further discussion and proofs of these properties. 

Definition 2 A non-negative function / : R” —> M is log-concave if V x, y £ dom / and A £ 

( 0 , 1 ) 

f(Xx+(l-X)y)>f(x) x f(y) 1 -\ 

For strictly positive functions /, this definition is equivalent to the condition that log / is 
concave. It is easy to verify, therefore, that the Gaussian density ip is log-concave. Lemma |T| 
recalls basic properties of log-concave functions. 

Lemma 1 The following properties hold for log-concave functions: 

— If f : R ra —► R and g : R" — > R are log-concave, then the product h(x) = f(x)g{x) is 
log-concave. 
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— If f : R" —> M is the indicator function of a convex set, then f is log-concave. 

~ Iff ■ R n x M m —x M is log concave, then g{x ) = f f{x,y)dy is log-concave on R™. 

Proof See Boyd 

With these basic properties, we can proceed to prove the following lemma. 
Lemma 2 The set S e is convex. 


Proof Let I(s,r) = 1 if s < r and zero otherwise. That is, I is the indicator function 
for the convex set {( a,b ) : a < b}. Therefore the function g(t,x,y) = <p(t)I(t,y)I(x,t) is 
log concave, because it is a product of log concave functions. Then for y > x,f(x,y) = 
f y ip(t) dt = J ip(t)I(t,y)I(x,t) dt is log concave, because it is the marginal of a log concave 
function. Hence S e is convex because it is an upper level set of a log-concave function. 


Convexity of S e proves convexity of the very simple chance constraint P(a; < £ < y) > 
1 — e for all e £ (0,1) with respect to (x, y) £ R 2 . Note that this convexity result is a special 
case of the rectangular constraints considered by [21] . In order to account for products 
between the decision variables and the random variables, we require the following additional 
developments. 


Definition 3 Let S e = cl {(x,y,z) : ( x/z,y/z ) £ S e ,z > 0} be the conic hull of S e (where 
cl is the closure operator). 

By standard results |12| . S f is convex. The following lemma, in which we prove mono¬ 
tonicity properties of the set S e , is key to our main result. 

Lemma 3 Let e £ (0, |]. Then (x, y,z) £ S e iff z > 0 and 3 x' > x, y' < y, and z' > z such 
that (x r , y', z') £ S e . 


Proof Suppose we are given (x',y',z') £ S e and ( x,y,z ) with x < x',y > y ', and 0 < z < 
z'. We will show that ( x,y,z ) £ S e . By symmetry of the Gaussian density and e < 
{x',y',z') £ S e implies x' < 0 and y' > 0, so x/z < x'/z < x' / z' and y/z > y'/z > y' / z'. 
By increasing the upper limit of integration or decreasing the lower limit of integration, we 
can only increase the value of the integral, so 



cj>(t) dt > 



f>(f) dt > 1 


— e. 


( 12 ) 


For the case of z = 0, take a sequence of decreasing iterates z i = z', Z 2 , £ 3 ,... with Zi —> 0. 
For each i, the above argument shows {x,y,zf) £ S e , which implies (x,y, 0) £ S e since S e is 
a closed set. 


With these properties, we now prove the main result of this section. 

Theorem 1 Let f be a vector of n i.i.d. standard Gaussian random variables, 0 < e < I 
and 

C:={(a,6,i)£lxRxI" : P(a < x T £ < b) > 1 — e} . 

Then C is a projection of the convex set 

{(a,6, x,t) £ 1 x R x R" x R : ||ac|I 2 < t, ( a,b,t ) £ , 


and hence C is convex. 
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Proof 

iff 

iff 

iff (by Lemma^fy 


P(a < x T f, < 6) > 1 - e 


a x 1 £ b 

— < 7 --^ < 7—7 


F 2 F 2 F 2 


> 1 - e 


(a, b, ||x|| 2 ) £ 


3 i > ||x|| 2 such that (a, b 1 1 ) £ SF 


(13) 

(14) 

(15) 

(16) 


Where the equivalence between (141 and (15) holds because 


P_L 


random variable. The above proof assumes x 0. For the case of x 


is a standard Gaussian 
= 0 , 


P(a < x T f, < 6) > 1-e 


(17) 


iff 

a < 0 < b 


iff 


(a, b , 0) £ S e . 


(18) 

(19) 


The justification for the final equivalence is as follows. If the strict inequality a < 0 < b 
holds, then lim z _ ) .o+ J^/z pit) dt = 1, so membership holds in S e . If a = 0, 6 = 0, or both, 
then we can construct a sequence of points (ai,bi,0) —> (a, 6,0) with each (aj,6j,0) £ S e , so 
the statement holds because S e is closed. 


More generally, 

Lemma 4 Let £ ~ N(p,£) be a jointly distributed Gaussian random vector with mean p 
and positive definite covariance matrix £ and 0 < e < |, and let 

:= {(a,b,x) £ I x I x R" : P(o < x T f, < b) > 1 - e} . 

Then C^z convex. 

Proof Let LL T = £ be the Cholesky decomposition of the covariance matrix £. Then £ = 
L( + p where f is a vector of i.i.d. standard Gaussian random variables. The point (a,b,x ) 


satisfies 

P(a < x T f, < 6) > 1 - e 

(20) 

iff 

P(a < x T (Lf + p) < 6) > 1 — e 

(21) 

iff 

(a — p T x, b — p T x, L T x) £ C. 

(22) 


That is, the set C^ : z is an affine transformation of the convex set C representing the i.i.d. 
case, and hence C^, t z is convex. 
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4 Exact extensions of two-sided constraints 

In this section, we generalize the basic result in Section [3] to a number of cases in which a 
seemingly more complex chance constraint can be represented exactly by using two-sided 
chance constraints. 


4.1 Nonlinear chance constraints 

The simplest nonlinear constraint we consider, which will be used in formulating the ap¬ 
proximation of the quadratic chance constraint in Section [6j is the absolute value constraint. 

Lemma 5 Let £ ~ N(fi,S) be a jointly distributed Gaussian random vector with mean g 
and positive definite covariance matrix £ and 0 < e < \. Then the set 

{(o, tjijelxRxK": P(|ce t £ + a\ < b) > 1 — e} (23) 


is convex. 


Proof P(|x T £ + a\ < b) > 1 — e iff P(— b — a < x T tj < b — a) > 1 — e. 

The above lemma is a special case of the following significantly more general theorem: 


Theorem 2 Let f : R. —> R. be a convex function which attains its minimum at x = c, 
let g : R m —> R be an arbitrary convex function, and let £ be a standard Gaussian random 
vector (without loss of generality, we can assume independence and zero mean). Let e < 
Then the set 

D ■= {(x, z, b) G x R m x R : P (f(x T £ + b) + g(z) < 0) > 1 - e} (24) 

is a projection of the convex set 

{{x, z, b, k, x', y', t) e K n+m + 5 : f > ||as|| 2 , k < — g(z), x' > l(k) — b — c, (25) 

y' < u{k) - 6- c, {x',y',t) e iSJ (26) 


where l and u, are explicitly computable convex and concave functions, respectively, which 
we define below depending on f. And hence, D is convex. 


Proof Let l(k) and u(k) be functions such that f(x — c) < k iff x £ [ l(k),u(k )]. We can 
obtain l and u by shifting the graph of f so that the minimum is at zero and then reflecting 
the graph along y = x, and since /(• — c) is decreasing up to zero and increasing after zero, 
we have in particular that u(k) is concave and increasing and l{k ) is convex and decreasing. 
Then 


iff 


iff 


P(/(cc t £ + &)+ 5 (z)<0)>1- 


P (l{—g(z)) < x T l ; + b + c < u(—g(z))) > 1 — e 


P(l(—g(z)) — b— c< x T $, < u(—g(z)) — b — c) > 1 — < 


(27) 


(28) 


iff (by Theorem^ 7]) 


(29) 
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Finally, 

x > l(—g(z)) — b — c and y < u(—g(z)) — b — c 
iff (by l decreasing and u increasing) 

3 k < —g(z) such that x' > l(k) — b — c and y' < u{k ) — b — c. 


(31) 


(32) 


Since l{k ) is convex andu(k) is concave, conditions (30) and (32) give a convex formulation, 
in an extended set of variables, for the chance constraint (24). 

Theorem [2] is sufficiently general to shed light on the quadratic chance constraint (11) 
which motivated our original work. If one of the terms in the chance constraint is deter¬ 
ministic, then the constraint is indeed convex, as the following lemma shows. This simpler 
form of the quadratic constraint itself can be useful for the motivating application in power 
systems, if, for example, the reactive power flow across a transmission line is not subject to 
randomness. 


Lemma 6 Let f ~ N(g,£) be a jointly distributed Gaussian random vector with mean g 
and positive definite covariance matrix £ and 0 < e < g. Then the set 

{{x, b, k, z) £ R n x M x R x M : P((z T £ + b) 2 + z 2 <k)> 1 - e} (33) 


is convex. 

Proof Set f{y) = y 2 , g(z, k) = z 2 — k and apply Theorem^ 

A similar proof technique as used in Theorem[2]can also be applied in other cases. In the 
following lemma, we demonstrate convexity of the quadratic chance constraint in another 
special case when the random variable is univariate. 

Lemma 7 Let f be a scalar standard Gaussian random variable and let e £ (0,1). Then the 
set 

{(&, d, k) £ M 3 : P((£ + b) 2 + (£ + d) 2 < k) > 1 - e} 

is convex. 

Proof By applying the quadratic formula, we see that (£ + b ) 2 + (£ + d) 2 < k iff £ £ 
[/ l(b,d,k),u(b,d,k )] where 

l{b, d, k ) = * (-(d + b) - \/2fc - (d - 6) 2 ) 

and 

u(b , d, k ) = - (d + b) + \/2k — (d - b) 2 ^ . 

By analogy with the proof of Theorem Q it suffices to show that l is convex and u is concave. 
To prove this, it suffices to show that k — (d — b) 2 is concave, which holds since \f- is 
concave increasing and 2k — (d — b) 2 is concave. Note we allow e > \ because this proof 
requires only monotonicity properties of S e , not S e . 

We also observe that when / and g in Theorem[2]are piecewise linear, e.g., as in Lemma[5] 
then we have demonstrated the convexity of a special family of joint linear chance con¬ 
straints. 
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4.2 Distributionally robust two-sided chance constraints 


So far we have left unquestioned the assumption that the parameters p and £ of the Gaus¬ 
sian distribution are known with certainty, when often they are subject to measurement 
error. For the case of linear chance constraints, Bienstock et al. [5] propose a tractable 
model that enforces robustness with respect to deviations of the parameters /j and £ within 
a known uncertainty set U. Lubin et al. f!6j implement this model and demonstrate signif¬ 
icant cost savings in the context of short-term operational planning of power systems when 
tested against out-of-sample realizations of uncertainty. Here, we define and demonstrate 
tractability of a similar distributionally robust model in the context of two-sided chance 
constraints. 

Let £ ~ N(p, £) be a jointly distributed Gaussian random vector with mean p and 
positive definite covariance matrix £ and 0 < e < |, and let LL T = £ be the Cholesky 
decomposition of £. 

From Lemma [4j recall 

P(a < x T l ; < b) > 1 — e (34) 

iff 

3t > ||L T a ;||2 such that (a — p T x , b — p T x , t) £ S € . (35) 

We define the distributionally robust (or ambiguous) two-sided chance constraint as: 

Vs~N(v,E)(a < x T f <b)> 1 - e V(/u, £) £ U (36) 


Lemma 8 For e < | and under the assumption that the uncertainty set decomposes by p 
and £, i.e., U = U^x Us, then the constraint (36) is tractable if we can tractably optimize 
a linear objective over the sets U( and Us- 


Proof Note that (36) is a convex constraint, because it is the intersection of (infinitely) many 
convex constraints. We will prove tractability by demonstrating that we can easily separate, 
i.e., find the worst-case p and £ given ( a,b,c ). 


We have that (36) holds iff3t s.t. 

t > 


11-He 


Llxl 


(■ a — p T x, b — p T x, t) £ S e 


M£ GUs 
V/z£ U„ 


(37) 

(38) 


Constraint (37) can be reformulated as t > ^/max^gj/r x T £x, so we can separate by op¬ 
timizing a linear objective over Us- For t > 0, the separation problem corresponding to 
constraint (38) is 


min <P{(b — p T x)/t) — <£((a — p T x)/t ), 


fL(zU l 


(39) 


which is a minimization of a log-concave function. However, observe that it is essentially a 
one-dimensional problem depending on p T x, so it can be solved by testing with the values 
min^g^ p T x and ma x m6C/(j p T x. 


5 Computational tractability of S e and S e 

We have demonstrated applications of the set S e and its conic hull S e to represent a number 
of classes of convex chance constraints. So far, we have used these sets as a theoretical tool 
in order to prove convexity. In practice, we are also interested in computationally tractable 
representations of these sets, which ideally can be used within off-the-shelf solvers. 
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5.1 Representation using convex functions 

Recall from Lemma that the function f(x,y ) = ip(t) dt = d>(y) — <P(x) is log-concave. 
Therefore the equivalence ( x,y ) £ S e iff log f(x,y) > log(l — e) provides a representation 
which can essentially be used directly within derivative-based nonlinear solvers, which expect 
constraints in the form }{x) < 0 where / is smooth and convex. Furthermore, the perspective 
function 

g{x,y,z) = z(log(<%/z) -$(x/z)) - log(l - e)) 

is concave mi, and one can see that ( x , y , z) £ S e iff g(x, y, z) > 0 and z > 0, which provides 
a potentially useful representation of S € . 

The above representations are valid for the interval e £ (0,1). For the special case of 
e £ (0, |), we note that for x > 0, d>(x) is concave, and for x < 0, <P(x) is convex. Since 
e < \ and f(x, y) = <P{y) — <&(x) > 1 — e imply x < 0 and y > 0, we note that / itself is 
concave over the domain of S e because it is a sum of two concave functions. This observation 
provides an alternative convex representation of (x, y, z) £ S e with the constraints 

z(<P(y/z) — d>(x/z) — (1 — e)) > 0 and z > 0. (40) 

With either convex representation, the derivatives are easy to compute when z > 0. However, 
derivative-based solvers may fail as z —> 0. 


5.2 Separation oracles 


A functional, derivative-based representation of S e may be directly applicable in many sit¬ 
uations, but alternative solution methods exist. For example, algorithms for convex mixed- 
integer nonlinear optimization typically make use of a combination of continuous nonlinear 
relaxations and iteratively generated polyhedral outer approximations |20l l7] . In this section 
we discuss how separation oracles could be implemented to generate such polyhedral outer 
approximations. Our focus is on developing separation oracles which lead to polyhedral ap¬ 
proximations which are “better” than the more common approach which follows from the 
functional representation of Section 5.1 in the sense of producing hyperplanes which are 
tangent to the set S e . 

In brief, when a separation oracle for the set S e is given a point (x,y), it first determines if 
(x, y) £ S e . If (x, y) ^ S e , it returns a hyperplane (a, b) £ R 1 2 xR such that aiX + a,2y > b and 
S e is contained in the halfspace defined by {( x,y ) : a\X + a,2y < b}. Hence, the hyperplane 
separates the point ( x,y ) from the set S e . 

First we note that a separation oracle for S f immediately provides a separation oracle 
for the conic hull S e . Suppose ( x,y,z ) ^ S e and z > 0. Then [x/z,y/z) ft S e so we take a 
hyperplane a\X + a, 2 y = b which separates (x/z, y/z) from S € . then the hyperplane a\X + 
02 y — bz = 0 separates (x, y , z) from S e . If z = 0, note that, assuming e < (x, y, 0) £ S e iff 

y > 0 and x < 0, so these two constraints serve as the separating hyperplanes in this case. 
Thus we restrict our discussion to separation oracles for S e . 

The most straightforward separation oracle for a convex set described by a smooth 
convex function is as follows. For any smooth, convex function /, if we are given x' with 
f(x') > 0, then the hyperplane f(x') + V/(x')( x — x ')<0 separates x ’ from the feasible set 
of {x : f(x) < 0} HQ For the case of 5 e , however, this hyperplane is weak. More specifically, 


1 Taking for granted that we can compute ^(-) efficiently, this gradient-based separation oracle may also 

be of theoretical use in proving tractability via the ellipsoid algorithm ng. 
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we have 

(x, y) G S e iff /( x, y) := 1 - e - <P(y) + F(x) < 0. (41) 

If we use this representation to separate the point (0,0), then /(0,0) = 1 — e and 
V/(0,0) = (^=,--!=), and our separating hyperplane is 

x — y < — v / 27t(1 — e). (42) 

Figure |T| shows the set S e together with this separating hyperplane. Observe that the hyper¬ 
plane is not tangent to S e , which means that it may serve poorly as an outer approximation. 



Fig. 1 In blue, the set S e , with e = 0.05. In orange, the half space corresponding to the separating hyper¬ 
plane (|42|. The hyperplane separates the point (0,0) but is not tangent to S e . 


Instead of using this hyperplane, we might consider computing the best possible sep¬ 
arating hyperplane with the same slope by evaluating max( IiS) ) £ s < x — y. By a symmetry 
argument which we omit here, this value is 2^ _1 (e/2), so we can strengthen the previous 
hyperplane to 

x-y < 2<?- 1 (e/2). (43) 

More generally, the support function (JsS a i &) = max( x>y ) g s e ax + by enables one to 
compute the best possible separating hyperplane of a given slope m- Another approach 
to generating tangent separating hyperplanes is to compute an orthogonal projection of the 
point (x, y) onto the set S e and then add a hyperplane which is tangent to the projected 
point. In the following section, we provide simple representations of the support function 
and orthogonal projection operators for the set S e . These developments may enable practical 
implementations. 


5.3 Support function of S e and orthogonal projection onto S e 
We begin with a lemma which characterizes the boundary of S e . 

Lemma 9 Let e G (0, |). Then the point ( x,y ) lies on the boundary of the set S e> i.e., 
y) — <P(x) = 1 — e iff3\£ (0,1) such that x = ^ _1 (Ae) and y = — (1 — A)e). 

Proof First, let A G (0,1). Then (1—A)e))— <P(<P~~ 1 (\e)) = (1 — (1—A)e) — Ae = 1— e, 

and so the point ( ( P~ 1 (Xe), ( P~ 1 (l — (1 — A)e)) lies on the boundary. In the other direction, 
suppose the point (x,y) is on the boundary of S e . Set A = <I(x)/e. Note that since x and 
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y are finite, we must have 0 < <L>{x) < e and 1 — e < <P(y) < 1, and hence 0 < A < 1. 
Trivially x = d>~ 1 (Xe). Then <L(y) = 1 — e + <P(x), and we see that y = <£ _1 (1 — e + Ae) = 
^-i(l_(l_A) e ). 


This result provides an explicit univariate parameterization of the boundary of S e in 
terms of A, which is quite useful for computational purposes. For example, suppose we 
wanted to minimize a function g(x,y) along the boundary of S e . Then this problem can be 
formulated as a one-dimensional search problem, 


min g(<P 1 (Xe),<P 1 (1 — (1 — Ale)). (44) 

Ae(o,i) 


The following lemma uses the formulation (44) to demonstrate that optimization of some 
linear functions over S e can be expressed as a univariate convex optimization problem. 


Lemma 10 Suppose g(x,y) = 
convex optimization problem. 


ax + by with a < 0 and b > 0. Then (44) is a smooth, strictly 


Proof Define /i(A) := g(<L> 1 (Xe),T> 1 (1 — (1 — A)e)). We explicitly calculate the derivatives 
using the following basic formulas: 


d _ _ -| , . /—— ^ 1 ( x ) 2 

(x) = V27re 2 
dx 

-fzQ-'ix) = 2 t T<P~ 1 (x)e' l, ~ 1 ^ 2 
ax z 

so 

-^U(A) = 2 7 rae 2 <l>- 1 (Ae)e^' 1 ^ 2 +2 7 r6e 2 <?- 1 (l - (1 - A^e*'^ 1 ^ 1 -^ 2 . 

CLA Z 

Note d>~ 1 (Xe) < 0 and — (1 — A)e) > 0, so given a < 0 and b > 0, we have that 

S?KX) > 0. 

Following Lemma [TU] we have an efficient way to evaluate the support function 

as (a, b) = max ax + by. 

( x,y)eS e 

Specifically, when a > 0 and b < 0, we solve a one-dimensional convex minimization problem. 
If a < 0 or b > 0, then as e (a, b) = oo. If a = 0 and b < 0, as e (a, b) = b<P(l — e). If b = 0 and 
a > 0, as e {a,b) = a<P(e). These last two cases follow from taking the limit when A = 0 and 
A = 1, respectively. 


Lemma 11 We can compute an orthogonal projection onto S e by solving a one-dimensional 
strictly convex minimization problem. 

Proof Similar to Lemma \10\ we will use the parameterization of the boundary, but solv¬ 
ing j44| over a restricted domain. Given (a, b) (jL S t , the orthogonal projection is the solution 
to ( |44| ) with g{x,y) = ^(x — a) 2 + \ {y — b) 2 . Actually we do not need to optimize over all 
X £ (0,1); note that the orthogonal projection always lies on the boundary of S e between the 
projections along the x and y axes. More specifically, we need only consider 

X £ (l- ^(1-<£(&)), i<£(a)^ , 


( 45 ) 






Two-sided linear chance constraints and extensions 


13 


and within this interval, by construction, the inequalities 

< a and ^ _1 (1 — (1 — A)e) > b 


(46) 


hold. 

Define h(X) := — a) 2 + (1 — A)e) — b) 2 . We will prove strict convexity 

of h within the domain (|45[) by showing that > q. From the chain rule (for arbitrary f), 


d?_ 1 
dx 2 2 


(/O) 


«) 2 =(£w 


+ (f(x) 


J 2 f, 1 


Discarding the squared first derivative terms, we have 

^(A) > (<£ _1 (Ae) - a)^F~\\e) + (^(l - (1 - A)e) - &)^(* _1 ( 1 - (1 


The result follows from noting that 1 (Ae) < 0 and 1 (1 — (1 — A)e) > 0 

with the inequalities (461. 


— A)e) 
combined 


5.4 An approximate polyhedral representation of S e 

In this section, we develop an approximate polyhedral representation of S e . 

Definition 4 A polyhedron P e is an outer approximation of S t if S e C P e . 

While polyhedral outer approximations are straightforward to generate, either through 
an iterative cutting-plane procedure or by preselecting a number of tangent hyperplanes, 
we are interested in outer approximations with a provable approximation guarantee, in the 
sense which we now define. 

Definition 5 A family of polyhedral outer approximations P t forms an a-approximation 
of S e ifVee (0,i], 

y) — <P(x) > 1 — ae V(cc, y) £ P e . (47) 

Or equivalently, when ae < f, S e C P e C S ae . 

We restrict e < | for notational convenience and because this is the case of direct interest, 
although many of the results here generalize for e £ (0,1). 

Note that although our development is from the perspective of outer approximation, a 
family of polyhedral outer approximations may be used to generate conservative approxi¬ 
mations as well, since if P e is an a-approximation, then 

P e/a CS e Ve€(0,^]. 

We begin with a very simple 2-approximation of S e with the axis-aligned polyhedra: 

A e = {(x,y) gM 2 : x < F~ 1 (e),y > 0 _1 (I - e)}. (48) 

For (x,y) € A e , by monotonicity of the cumulative density function <P we conclude 


d>(y) — d>(x) > (1 — e) — e > I — 2e. 


(49) 
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Fig. 2 In blue, the set S e ■ In orange, the polyhedral outer approximation A e (left) and B e (right). By 
adding a single additional inequality, we strengthen the relaxation significantly. 


This 2-approximation of S e is equivalent to representing P(a < x T £ < b) > 1 — e by using 
the two standard linear chance constraints P(a < x T £) > 1 — e and P(:c T £ < b) > 1 — e. 
Bienstock et al. [5] employ this approximation citing improved computational tractability. 

The 2-approximation model is the best one can achieve with two linear constraints in the 
following sense. The set S t has two extreme rays: (—1, 0) and (0,1), which follow from the 
fact that <P is monotonic increasing. Therefore, any outer approximation of S e must contain 
these rays. If, in addition, these are not the extreme rays of the outer approximation, then 
the approximation cannot be an a-approximation for any a, because S ae cannot contain 
the set. 

The main result of this section is that with a single additional linear constraint, one 
may improve the above 2-approxinration to a 1.25-approximation. The axis-aligned approx¬ 
imation performs poorly at the “corner” where x = 1 (e) and y = <2> _1 (1 — e). If we 
add a hyperplane to separate this point, from the previous discussion we obtain the hyper¬ 


plane (431. 


Therefore we define the family of polyhedra as 


Be ■■= {( x,y ) G 


l 2 :x< ^~\e),y > ^(l -e),x-y< 2<P~ 1 (e/2)}. 


(50) 


The family B e forms a valid outer approximation because A e is a valid family, and we’ve 
added a valid separating hyperplane. Figure [2] displays the two families of approximations 
for a fixed e. 

The following lemma simplifies the task of proving the a-approximation. 


Lemma 12 For a < 2, a family of polyhedral outer approximations P e forms an a-approximation 
ofS e iffVee( 0,§] 

1. V vertices (x,y) of P t , we have d>(y) — F(x) > 1 — ae, and 

2. the extreme rays of P e are (—1,0) and (0,1). 

That is, it is sufficient to verify the approximation quality at the vertices. 

Proof Fix e and suppose that the two above conditions hold. Then all vertices, by definition, 
are contained in the set S ae (our assumptions imply ae < 1). By convexity of S ae , this 
implies that all convex combinations of the vertices of P e are contained in S ae . All elements 
of the polyhedron P e can be represented as a convex combination of its vertices plus a conic 
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combination of its extreme rays. Since the extreme rays (—1,0) and (0,1) are also extreme 
rays of S at , it follows that P t C S ae . 

The vertices of B e are (^ _1 (e), ^ _1 (e) — 2<L~ 1 (e/2)) and {2d>~ 1 {e/2)-\-d>~ 1 (l — e), ^ -1 (1 — 
e)). By the identities ^ _1 (1 —e) = — <£ _1 (e) and #(— x) = 1 — we see that these vertices 
are in fact symmetric, so it is sufficient to consider only one of them. 

We first establish a simple bound that does not use any deep properties of the Gaussian 
distribution. 


Lemma 13 The “three-cut” family of outer approximations B e forms a 1.5-approximation 
of S,. 

Proof Consider the vertex (T > ~ 1 (e) 1 <P~ 1 (e) — 2<P~ 1 (e/2)). It is sufficient to show that it is 
contained in the set S/ 5e . 

#(# _1 (e) - 2<P~ 1 (e/2)) - <2>(0 _1 (e)) = $(($ _1 (e) - <Z> _1 (e/2)) - $~ 1 {e/2)) - e (51) 


> 0(-$ -1 (e/2)) - e 
= $($ _1 (l-e/2))-e 
= 1 - 1.5e, 

where the inequality follows from ^ _1 (e) > T’~ 1 (e/2) and monotonicity of T>. 


(52) 

(53) 

(54) 


We can improve this bound by using properties of the Gaussian distribution. Lemmas |14| 
and |i~5| below develop the necessary properties, and Theorem [3] states the final result. 

Lemma 14 — |) — ^ -1 (1 — e) > ^ -1 (1 — |) — — |) for e £ (0, \\. 

Proof Let /(e) = <P~ 1 ( 1 — |) —^ _1 (1 —e). Then we intend to show /(e) > /(e/2) Ve £ (0, \]. 
It suffices to show that f is monotonic increasing over the interval. 

Recalling 


—T’ x (x) = Vrjrexp 
ax 


1 (xf 


have 


/'(e) = --V^exp 


(P-^I- |) 2 
2 


+ V2tt 


exp 


^ _1 (1 — e) 2 
2 


We will show that f is always positive for e £ (0 , At e = ^, <P 1 (|) = 0, so 

2 


f'(x\ = V2n — xV2tt i 


exp U*- 1 - /2 « 0.93 > 0. 


Suppose, for contradiction , /'(e') = 0 for some e'. Then 

- e') 2 


exp 


1 ($- 1 ( 1 - 
= X exp -„— 


which implies 


^ _1 (1 — e') 2 = —2log(2) + ( 1 — 7 T 


2 


(55) 
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Note that g(e) := <P~ 1 ( 1 — e ) 2 is strictly convex for e £ (0,1) by examination of the 
second derivative. This means that (/(e) is strictly monotonic increasing. We’re looking for 
a solution to g(e/ 2) — g(e) = 21og(2). Note that (/(e/2) — g(e) is strictly decreasing over the 
interval because (1/2)< 7 '(e/2) — (/(e) < 0. One can verify the limit 


lim <P 1 
£->0 + 



<£ _1 (1 — e ) 2 


2 log( 2 ), 


which implies in fact that there can be no solution to (551. This proves our original claim. 
Lemma 15 < T~ 1 (e) — 2<£ _1 (|) > ^ -1 (1 — |) for e € (0, \) 

Proof Applying Lemma [7/1 we have: 


= ^(l - \) + (^(l - ^ e)) 

= ^( 1 - 7 ) 


(56) 

(57) 

(58) 


Theorem 3 The “three-cut” family of outer approximations B t forms a 1.25-approximation 
ofS,. 

Proof Consider the vertex (^ _ 1 (e), T’~ 1 (e) — 2<£ _ 1 (e/2)). It is sufficient to show that it is 
contained in the set Si. 25 e- 


<P(<P~ 1 (e) - 2<F- 1 (e/2)) - <?(^ _ 1 (e)) = 

> - e/4)) - e 

= 1 - 1.25e, 


where the inequality follows from Lemma 15 


(59) 

(60) 
(61) 


With an additionally highly technical argument which we omit for brevity, it is possible 
to show that the 1.25 value is tight; that is, the “three-cut “ family of outer approximations 
B e is not an a-approximation for any a < 1.25. 

We summarize the results of this section with a succinct statement of an SOC outer 
approximation of the two-sided chance constraint based on B e . 


Lemma 16 Let £ ~ iV(/x, E) be a jointly distributed Gaussian random vector with mean g 
and positive definite covariance matrix E and 0 < e < Let LL T = E be the Cholesky 
decomposition of E. The following extended formulation, with the additional variable t, 

t>\\L T x\\ 2 , (62) 

a — g T x < T>~ 1 (e)t, (63) 

b — g T x > — e)t, (64) 

a — b < 2<L>~ 1 (e/2)t. (65) 

is an SOC outer approximation of the constraint 

P(ct < x T f < b) > 1 - e 

which in fact guarantees 

P(a < x T f < b) > 1 - 1.25e. 
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Proof From Lemma [^j 

iff 


P(a < x T £ < b) > 1 — . 


3t > ||L T a ;||2 such that (a — /i T x, b — p T x, t) £ S e . 


( 66 ) 


(67) 


We take the conic hull of the polyhedral representation B e (50) of S e in order to represent 

s e . 


6 Approximation of quadratic chance constraints 

Having extensively discussed the tractability of the two-sided chance constraint model and 
its extensions to represent more complex nonlinear chance constraints exactly, we return to 
our original motivation as discussed in Section [2] In this section, we will investigate the use 
of two-sided chance constraints to approximately represent a family of challenging quadratic 
chance constraints. These sets are of the form, 

H e = {(a, 6, c, d, k) eK"xRxl"xRxI: P((a T £ + bf + ( c T f + d ) 2 < k) > 1 - e} , 

(68) 

where a and c, and &, d, and k are (vector and scalar, resp.) decision variables and £ follows 
a multivariate Gaussian distribution with known mean and covariance matrix. 


6.1 Convexity of the quadratic chance constraint 


We are unaware of any existing results on the convexity of the set H e (68). We present here 
a proof of nonconvexity for the case of e = 0.455. The counterexample, while not as strong 
as a proof of nonconvexity for all e £ (0, g], suggests that convexity is, at the least, not a 
simple extension of existing results such as those presented in Section [4] which hold for all 
e £ (0, |]. We leave the question of convexity of H e over the full range of e for future work. 
Nevertheless, we take this counterexample as a justification for seeking tractable, convex 
approximations of H e in subsequent sections. 

Consider the constraint 


P(0r£i) 2 + (x/6) 2 <1)>1 — 


(69) 


where £i and £2 are independent, standard Gaussian random variables. The constraint (69) 


is a special case of (681 with £ = (£ 1 , £2), a = (x, 0), b = 0, c = (0, y), d = 0, and k = 1. 

Figure [3] traces the value of the left-hand side of (69) along the line y = —x + 1.6. We 
see that the upper level sets of the function f(x, y) = P((x£i) 2 + (yff) 2 < 1) are not convex. 
In particular, the points (0.6,1.0) and (1.0, 0.6) belong to Ido. 455 (by numerical integration 
with reported error bounds of 10~ 7 ) but the point (0.8, 0.8), their average, does not. We 
can evaluate P((0.8£i) 2 + (0.8£2) 2 < 1) more explicitly as F X2 (1/0.8) ~ 0.542 where F X2 is 
the cumulative distribution function of the chi distribution with two degrees of freedom. 
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>, °- 55 ° 



X 


Fig. 3 On the vertical axis, the value of the left-hand side of ( |69[ ) evaluated at the point (x, —x + 1.6) by 
numerical integration (with approximate error bounds of 10 — '). We see that the set of points along this line 
that satisfy the quadratic chance constraint with probability 0.545 or greater, for example, is not convex. 
This proves nonconvexity of H t with e = 0.455. 


6.2 Approximation using two-sided constraints 


We propose an approximation of the quadratic chance constraint (681 by two absolute value 


constraints, essentially splitting up the squared terms into separate constraints. We use 
the union bound to enforce a conservative approximation, hence we introduce a parameter 
P £ (0,1) to balance the trade-off between violations in the two separate constraints. In 
Lemma ITT] below, we state this formulation formally and prove that it is a valid convex, 


conservative approximation of (681 in an extended set of variables. 


Lemma 17 Fix e < \, fix P £ (0,1), and let 


G r .„ = 


| (a, b, c, d, k, A, f 2 ) £ R 2n+5 :P(|a T e + &| < A) > 1 - /?e 

H\c T (i + d\<f 2 )>l-(l-p)e 

fi +fl<k 


(70a) 

(70b) 

(70c) 


Let G pr p J be the projection of the set G et p onto the variables ( a,b,c,d,k). Then G pr p 3 is 
convex and G pr P 3 C H e . That is, the set G pr P 3 is a conservative, convex approximation of 


the quadratic chance constraint (68). 


Proof Let (a, b , c, d, k , fi, f 2 ) £ G e p 3 . To simplify the proof, let x = + b and if = c T f + d 

be random variables. Then y 2 < / 2 and if 2 < /f implies y 2 + ip 2 < k, which gives us the 
inequality 

P(X 2 + V* 2 < k) > P( X 2 < fl and if 2 < f 2 ). (71) 

From the union bound, 


P(X 2 < fl and ^ < f 2 ) > P( X 2 < f 2 ) + F(if 2 < f 2 ) - 1 

> 1 - pe + 1 - (1 - p)e - 1 

= 1 - e, 


(72) 

(73) 

(74) 
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which proves C H t . Convexity of G e ,p (and therefore G^™ 3 ) follows from Lemma 

and the fact that fi+f^Gikisa convex quadratic constraint. 


Replacing constraints (70a) and (70b) with the outer approximation (62)-(651, one ob¬ 
tains an SOC-representable approximation of H e . Note that this approximation is no longer 
conservative, but can be made so by instead using the polyhedral conservative approximation 
of S f as discussed in Section 15.41 


6.3 Approximation via robust optimization 


An alternative conservative approximation which we consider is based on robust optimiza¬ 
tion [2j. 


Lemma 18 Let e £ (0,1) and suppose that £ follows an n-dimensional multivariate Guas- 
sian distribution. Without loss of generality, we assume each component is independent 
standard Gaussian with zero mean and unit variance. Let r = F~ 3 (l — e), where Pr 1 is the 
inverse cumulative distribution function of the chi distribution with n degrees of freedom. 
Let 


r 

XI a c 

1 

. (a, b , c, d, k , A) : 

k _ AP 2 bd 

j 

a T b 1 

1 

c T d 1 

J 


(75) 


where the notation A >z 0 means that the symmetric matrix A is positive 
blank entries represent zero blocks. Let RP ro3 be the projection of the set R e 
(a, b , c, d , k). Then RP r °i is convex and Pf ro C H e . That is, the set Pf™- 7 
convex approximation of the chance constraint (68). 


semidefinite, and 
onto the variables 
is a conservative, 


Proof R is sufficient to show that if (a, b , c, d , k, A) £ R e , then there exists a set U such that 
P(£ £ U) > 1 — e such that 


(a T ry + b ) 2 + ( c T r] + d) 2 < k, V q £ U. 


(76) 


Instead of allowing U to vary for any point in the set, which is equivalent to the original 
chance constraint, we fix U = {q : ||^||2 < P} and therefore obtain a conservative approx¬ 
imation. By the definition of the chi distribution, P(£ £ U) = 1 — e. In the terminology 
of robust optimization, U is an uncertainty set. It is a standard result, which follows from 
the S-lemma and Schur complement lemmas, that R v e ro • 7 is precisely the set of points sat¬ 
isfying (76) for this choice of the uncertainty set U fffj. Convexity follows since R e is the 
set of points satisfying a linear matrix inequality (LMI), which is tractable by semidefinite 
programming (SDP). 


Note that the choice of P = F~ 3 (1 — e) may be overly conservative, especially when n 
is large, although we are not aware of any theoretical guidance on choosing a smaller value 
of P such that the chance constraint remains satisfied. 
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6.4 Nemirovski-Shapiro CVaR approximation 


The third approximation we consider is based on the so-called CVaR approximation pro¬ 
posed by Nemirovski and Shapiro na. Let I{z) be the indicator function of the interval 
[0, oo), i.e., I(z) = 1 if z > 0 and I(z) = 0 otherwise. We can rewrite the quadratic chance 
constraint in the following equivalent expected-value form: 

E ? [I((a r £ + b) 2 + {c T f + df - k)] < e. (77) 

Nemirovski and Shapiro propose to upper bound the indicator function I with the convex 
increasing function ip(z) = max(l + z, 0), which, up to rescaling (z —> z/a for some a), is 
the best possible convex upper bound on the indicator function in the sense that if u>(z) is 
another convex increasing upper bound, then there exists a > 0 such that ip(z/a) < ui(z) 
for all z £ R. Then the constraint 

inf [E| [?/>(((a T f + b ) 2 + (c T £ + d ) 2 - k)/a)] — e] <0, (78) 

is a conservative approximation of the quadratic chance constraint ([77} which is furthermore 
convex in (a, b, c, d , k), which motivates the following lemma. 

Lemma 19 Let e £ (0,1) and suppose that f follows an n-dimensional multivariate Guas- 
sian distribution. Let 


NS e = {(a, b, c, d, k, a) : Ej [max((a T £ + b) 2 + ( c T l ; + d) 2 — k + a, 0)] < ae, a > 0} (79) 

Let NSf r ° : ’ be the projection of the set NS e onto the variables (a, b , c, d, k). Then NSf ro is 
convex and NS C H e . That is, the set NSf ro ^ is a conservative, convex approximation 
of the chance constraint (68). 

Proof See m- 


6.5 A comparison of approximations 

We have presented three convex, conservative formulations of the quadratic chance con¬ 
straint: one based on two-sided chance constraints, one based on robust optimization, 
and one based on convex approximation of the indicator function. All three have differ¬ 
ent tractability properties. In order of increasing computational difficulty, the two-sided 
approximation can be implemented, with small additional approximation error, by second- 
order cone programming (SOCP) following the developments presented in this work. The 
approximation based on robust optimization has an SDP formulation which may not be 
practical on large-scale problems, although we note the work of jl] where specialized meth¬ 
ods were developed to exploit the block structure. The CVaR approximation is the most 
computationally challenging; it has no known reformulation in terms of standard problem 
classes and requires multidimensional integration to evaluate. 

One might expect that the more computationally challenging approaches could yield 
tighter approximations. In this section, we examine a two-dimensional example in order to 
gain some understanding of the relative strengths of the approximations. We find, perhaps 
surprisingly, that no one approximation strictly dominates another. Hence, the two-sided 
approximation we propose has value in both its strength and ease of implementation. 
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As an example we will recall the simple case of 


P(Ki) 2 + (y6) 2 < i) > i - e, 


(80) 


where and £2 are independent, standard Gaussian random variables. 

Note that 11(^ 1 ,^ 2 )1 12 follows the chi distribution with 2 degrees of freedom, so in the 
robust approximation we can pick the uncertainty set U = { (771 , rj 2 ) : 11 (^7i , ^72) 1 12 < — 

e)} where F~^ is the inverse cumulative distribution function of the chi distribution with 


two degrees of freedom. 

In this example, (75) reduces to 


A x 

A y 

1 - A r 2 

x 1 

y 1 


>- 0. 


(81) 


By a Schur complement argument, the matrix ( |81[ ) is positive semidefinite iff 1 — AT 2 > 
0, A — x 2 > 0, and A — y 2 > 0, which holds iff a; € [— 1/T, 1/C] and y € [— 1/C, 1/C], a simple 
box constraint. 

An interesting observation is that for n = 2, the choice of C = F~ 2 (l — e) is minimal in 
the sense that any smaller value no longer corresponds to a conservative approximation of 


the chance constraint (801: 


p(((i/ c)£i) 2 + ((i/c)6) 2 < 1 ) = P(£ + C 2 2 < r 2 ) = F X 2 (r). 


(82) 


In other words, the robust approximation to (801 touches the boundary of the exact feasible 


set at the corners of the box. This observation eliminates the possibility of relaxing the 
overconservatism of the robust approximation by decreasing the size of the uncertainty set 
for the case of n = 2. 


A point is feasible to the two-sided approximation ( |70] ) for f) = A iff 3 /i,/ 2 such that 


fi + H ^ 1) P(|a;£i| < / 1 ) > 1 — |, and P(|y^ 2 | < / 2 ) > 1 — f • By symmetry, these two 
chance constraints hold iff /i/|cc| > C _1 ( 1 — |) and / 2 /1 2 /1 > <£ -1 (l — |). Therefore, the 
point (x,y) feasible to the two-sided approximation iff 


x 2 + y 2 < 


1 


a simple ball constraint. 

The CVaR approximation, to our knowledge, does not yield a closed-form algebraic rep¬ 
resentation, although in this simple case we are able to evaluate it by numerical integration. 

Figures |4] and [5] compare the three approximations with the exact feasible set for e = 0.5 
and e = 0.05, respectively. For e = 0.5, both the robust and the two-sided approximations 
dominate the CVaR approximation. For e = 0.05, no approximation is a strict subset of 
another. Curiously, for this particular case the exact set -ffo .05 appears to be convex. 
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Fig. 4 Outlined in black, the exact nonconvex feasible set ( x,y ) satisfying P((rc£i) 2 + (y£ 2 ) 2 < 1) > 1 — e 
for e = 0.5. We compare the three different convex approximations. 



.6 i i i i i 

- 0.6 - 0.4 - 0.2 0.0 0.2 0.4 


0 . 6 , 



Fig. 5 Outlined in black, the exact (seemingly convex) feasible set (a ;,y) satisfying P((x£i) 2 + < 

1) > 1 — e for c = 0.05. We compare the three different convex approximations. On the right, a zoomed-in 
view of the top-right corner shows that no approximation strictly dominates another. 


7 Conclusion 

Building on top of the basic convexity result for two-sided chance constraints developed 
in Section [3j we have shown, perhaps surprisingly, that a large class of more general non¬ 
linear chance constraints is in fact convex (Theorem [2]). In addition, our analysis of the 
computational tractability of the two-sided chance constraint, and in particular the poly¬ 
hedral approximation of the set S e with provable approximation quality, develops practical 
methodologies which we believe are novel in the chance constraint literature. Finally, we 
have demonstrated that the two-sided chance constraint yields a useful approximation of 
the quadratic chance constraint which originally motivated this work. 
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We believe that our convexity results in Section [3] can be easily extended to elliptical 
log-concave distributions following M- Extensions to more general distributions are not at 
all obvious, although the distributionally robust model in Section |4~2| may serve as a useful 
approximation. The conditions under which the quadratic chance constraint set H e is convex 
is left as an open question, although based on our computational experiments we conjecture 
that the set is convex for e sufficiently small. 
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